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The Strategic Formation of Multi-Layer Networks 

Ebrahim Moradi Shahrivar and Shreyas Sundaram 


Abstract 

We study the strategic formation of multi-layer networks, where each layer represents a different type of 
relationship between the nodes in the network and is designed to maximize some utility that depends on the 
topology of that layer and those of the other layers. We start by generalizing distance-based network formation to 
the two-layer setting, where edges are constructed in one layer (with fixed cost per edge) to minimize distances 
between nodes that are neighbors in another layer. We show that designing an optimal network in this setting 
is NP-hard. Despite the underlying complexity of the problem, we characterize certain properties of the optimal 
networks. We then formulate a multi-layer network formation game where each layer corresponds to a player that 
is optimally choosing its edge set in response to the edge sets of the other players. We consider utility functions 
that view the different layers as strategic substitutes. By applying our results about optimal networks, we show that 
players with low edge costs drive players with high edge costs out of the game, and that hub-and-spoke networks 
that are commonly observed in transportation systems arise as Nash equilibria in this game. 

Index Terms 

Multi-layer Network Formation, NP-hardness, Best Response Network, Game Theory, Nash Equilibrium, Trans¬ 
portation Networks. 


I. Introduction 

Examples of complex networks abound in both the natural world (e.g., ecological, social and economic systems), 
and in engineered applications (e.g., the Internet, the power grid, etc.). The topological structure of such networks 
(i.e., the relationships and interactions between the various nodes) plays a fundamental role in the functioning of the 
network. Early research on the structure of complex networks primarily adopted a stochastic perspective, postulating 
that the links between nodes are formed randomly |{2l, Q. An alternative perspective, driven by the economics, 
computer science and engineering communities, has argued that optimization (rather than pure randomness) plays a 
key role in network formation. In such settings, edges are formed strategically (either by a designer or by the nodes 
themselves) in order to maximize certain utility functions, resulting in networks that can be analyzed using game- 
theoretic notions of equilibria and efficiency Q-fZl- A particularly well-studied utility function is the so-called 
distance-based utility introduced in IH, Q, where the objective is to purchase edges to minimize the distances 
between all pairs of nodes in the network. Subsequent works on strategic network formation have looked at issues 
such as individual decision making, price of anarchy, and directed network formation lITOl - llTSl . 

While the existing literature on strategic network formation focuses predominantly on the construction of a single 
set of edges between the nodes, many real-world networks inherently consist of multiple layers of relationships 
between the same set of nodes. Examples include friendship and professional relationships in social networks, 
policy influence and knowledge exchange in organizational networks |[T4l . and coupled communication and energy 
infrastructure networks lITSl . ifT^ . While there has been growing research on different aspects of multi-layer networks 
in recent years |[T4l . IT71 - |[^ . the problem of strategic multi-layer network formation has started to receive attention 
only recently; aside from our initial work in fl], the paper |[2^ considers a model where each node can construct 
edges to a second high-speed network in order to minimize communication costs. 

Here, we begin a study of strategic multi-layer network formation by generalizing distance-utility network 
formation to the case where one layer (or network) is formed by optimizing the distances between nodes that 
are neighbors in another layer (or network). As a motivating example, consider the problem in |[2^ . where both the 
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physical infrastructure network and the traffic flow network between a group of cities are studied. Interpreting traffic 
flow as the weight of the connection between the endpoint cities, the objective is to design an optimal infrastructure 
network between cities with respect to the given traffic flow pattern. In the simplest case, this problem can be 
modeled as a network formation problem with a distance-based utility function where only the distances between 
specific pairs of nodes matter (i.e., those pairs with sufficiently high traffic flow between them). We address this 
class of problems by first defining a network Gi capturing an existing set of relationships between nodes, and then 
studying the formation of an optimal second network G 2 based on Gi. We call the optimally designed network G 2 
with respect to Gi the best response network to Gj. Distance-based utilities have also been used to study social 
networks (where each node is an individual and the edges indicate relationships) ||8], Q and the Internet (where 
each node represents a router and the edges indicate communication links) ifTOl . Our formulation generalizes the 
settings presented in those papers by allowing only distances between certain pairs of nodes (e.g., individuals in the 
social network or routers in the Internet) to matter when evaluating the utility of the network. For instance, in the 
case of the Internet or other communication networks, the reference layer Gi represents the virtual communication 
network indicating which pairs of nodes wish to exchange information, and the designed layer G 2 represents the 
physical communication network. 

While the best response networks have been completely characterized in the case where Gi is the complete 
network 111, ||9l, we show in this paper that finding a best response network with respect to an arbitrary graph Gi 
is NP-hard. We characterize some useful properties of the optimal networks that arise in this setting, including upper 
bounds on the number of constructed edges, lower and upper bounds on the utility of the best response networks, 
and conditions for the empty network to be a best response. These properties enable us to find best response 
networks with respect to certain specific reference networks, i.e., forests and networks with a star subgraph. 

We then use the notion of the best response network to model a scenario with multiple network designers, 
each of whom is building a different layer of the network. An example of this is when multiple transportation 
companies build their individual service networks among a group of cities, and each company prefers to provide 
service between pairs of cities that are not already covered by other companies. We capture these scenarios by 
defining a non-cooperative multi-layer network formation game where each player corresponds to a specific layer of 
the network. We develop a notion of distance-based multi-layer network formation based on strategic substitutes, 
where the presence of an edge in one layer makes it less desirable to have that edge in another layer. Despite 
the complexity of calculating best response networks, we characterize the Nash equilibrium networks that arise 
in this setting. In particular, we show that players with low costs for building edges drive out players that have 
relatively high costs, and that our framework gives rise to the “hub-and-spoke” networks commonly seen in various 
transportation systems ll24l . 


II. Definitions 

An undirected network (or graph) is denoted by G = {N,E) where N = {vi,V 2 , ■ ■ ■ ,Vn\ is the set of nodes 
(or vertices) and E C {[vi,Vj)\vi,Vj ^ N,Vi ^ vj}. The set of all possible graphs on N is denoted by G^. Two 
nodes are said to be neighbors if there is an edge between them. The degree of a node Vi ^ N is the number of 
its neighbors in graph G, and is denoted by degj(G). A leaf node is a node that has degree one, i.e., it has only 
one neighbor. A path from node vi to Vk in graph G is a sequence of distinct nodes viV 2 - ■ -Vk where there is 
an edge between each pair of consecutive nodes of the sequence. The length of a path is the number of edges in 
the sequence. We denote the shortest distance between nodes Vi and vj in graph G by dG{i,j)- If there is no path 
from Vi to Vj, we take dciiG) = 00 . The diameter of the graph G is max^. dG{i,j)- A cycle is a path 

of length two or more from a node to itself. A graph G' = {N',E') is called a subgraph of G = {N,E), denoted 
as G' C G, if C N and E' E E n {N' x N'}. A graph G' is said to be induced by a set of nodes N' C N if 
E' = E n {N' X N'}. A graph is connected if there is a path from every node to every other node. A subgraph 
G' = {N', E') of G is a component if G' is connected and there are no edges in G between nodes in N' and nodes 
in iV \ N'. 

A tree is a connected acyclic graph. For a connected graph G = {N,E), a connected acyclic subgraph T = 
{N, Et) of G is called a spanning tree of G. A spanning forest of a disconnected graph is a collection of spanning 
trees of each of its components. 

We denote the complete graph (i.e., the graph with an edge between every pair of different nodes) by G^ = 
{N,E^). We use G® = {N,(j)) to denote the empty graph. Finally, G® = {N,E^) is a star graph, which is a tree 
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graph with one node that is connected to all other nodes. The complement of graph G = {N, E) is denoted hy 
~ G = {N, ~ E), where E = E'^\E. Two graphs on the same set of nodes are said to he disjoint if their edge 
sets are disjoint. 


III. Distance-Based Utility 

A canonical problem in network formation introduced hy Jackson and Wolinsky involves distance-hased utilities 
Il8l . In this model, there is a net henefit of b{k) for each pair of nodes that are k hops away from each other in the 
network, where b : {1, 2, • • • , n — 1, oo} — M>o is a real-valued nonincreasing nonnegative function (i.e., nodes 
that are further away from each other provide smaller henefits) and 6(oo) = 0. There is a cost c G M>o for each 
edge in the network. The outcome of the network formation process is a graph G = {N, E) G G^. The utility (or 
value) of a given graph G G G^ is given hy the utility function 

u(G)= h{dG{kj))-c\E\. (1) 

Vi^Vj^N 

Vi^Vj 

In this formulation, there is an inherent trade-off faced hy the designer: adding links to a larger number of nodes 
provides a larger benefit (by reducing the distances between nodes), but also incurs a larger cost invested in links. 
An optimal (or efficient) network G satisfies u{G) > u{G'), 'iG' G G^ . 

The following resulf from [jH, shows fhaf when b{-) is a sfricfly decreasing function, fhere are only a few 
differenl kinds of efficienl nefworks, depending on fhe relafive values of fhe link cosfs and connection benefils. 
Proposition 1: O Assume fhaf b{-) is a sfricfly decreasing function. In fhe disfance-based ufilify model, 

• if c < 6(1) — 6(2), fhen fhe complete nefwork is fhe unique efficienl nelwork; 

• if 6(1) — 6(2) < c < 6(1) + {n — 2)6(2)/2, fhen fhe slar is fhe unique efficienl nelwork; 

• if 6(1) + {n — 2)6(2)/2 < c, fhen fhe emply nefwork is fhe unique efficienl nelwork. 

In fhe above proposition, whenever c is equal lo one of fhe specified upper or lower bounds, fhere will be more 
lhan one efficienl nelwork: if c = 6(1) — 6(2), fhen fhe complete nelwork and slar nelwork are bolh efficienl, and 
if c = 6(1) + [n — 2)6(2)/2, fhe slar nelwork and fhe emply nelwork are bolh efficienl nelworks wilh zero ufilify. 
Furfhermore, for fhe more general case where 6(-) is nonincreasing, fhe Ihree nelworks given by fhe above resulf 
are slill optimal for fhe corresponding ranges of costs and benefits, although they may no longer be unique. 

In the next sections, we will generalize the distance utility framework to the two-layer network formation setting. 
We will characterize the complexity of determining efficient networks in such settings and provide properties of 
such networks. We will then apply these results to study a multi-layer network formation game with multiple 
network designers. 


IV. Two Layer Distance-Based Utilities: 

Best Response Network 

In the traditional distance-based network formation problem described above, the objective is to minimize the 
distances between every possible pair of nodes. However, in many settings, one is only interested in minimizing 
distances between certain pairs of nodes. For example, consider a communications system where each node only 
wishes to exchange information with a subset of the other nodes, and the task is to design a physical network to 
provide short paths between those pairs of nodes. To handle these types of scenarios, in this section we generalize 
the study of distance-based network formation to a multi-layer setting. Specifically, suppose fhaf we have a layer 
(or graph) Gi = {N,Ei), where fhe edge sef Ei specifies a fype of relationship befween fhe nodes in N. Our 
objective is lo design anolher layer (or graph) G = {N, E) on fhe same sef of nodes, where fhe ufilify of fhe graph 
is given by 

n(G|Gi)= b{dG{hj))-c\E\. (2) 

{vi,Vj)eEi 

Nole fhaf fhe summation is only over edges in sef Ei, capfuring fhe facl fhaf only disfances befween Ihose pairs 
of nodes mailer in graph G; fhe Iradilional dislance ufilify funclion in ([T) is oblained as a special case when Gi is 
fhe complele graph. 


4 


Assume G 2 = {N,E 2 ) is a network that maximizes (jH); we say G 2 is a best response (BR) network to Gi, or 
equivalently, an efficient network with respect to the utility function 

Remark 1: The utility function ((U) does not necessarily have a unique maximizer; indeed, in many cases, there 
are multiple best response networks with respect to a given network, as demonstrated hy Example [T] helow. 

When Gi is the complete network, the best response is trivially a subgraph of Gi. However, the following 
example demonstrates that the best response network to a general network Gi does not necessarily have to be a 
subgraph of that network. 

Example 1: Consider the ring graph Gi with 6 nodes shown in Figure |l(a)[ Suppose 6(1) = c + e, for some 
small constant e > 0. Then, 

1) The utility (|2]) of Gi to itself is u{Gi\Gi) = 6(6(1) — c) = 6 e. 

2) Any subgraph of Gi with 5 edges is a path graph. This has utility 5(6(1) — c) + 6(5) = 5e + 6(5). 

3) Any subgraph of Gi with k edges, where k < 5, has utility A:( 6 (l) — c) = ke. 

Thus, when 6(5) > e, the best subgraph of Gi is the path graph with the utility given above. 

Now, the star graph shown in Figure |l(b)| has utility 26(1) + 46(2) — 5c. This is better than the path graph if 
46(2) — 36(1) > 6(5), which holds, for example, when 6(2) is sufficiently close to 6(1) and 6(2) > 6(5). Therefore, 
for utility functions that satisfy this property, no subgraph of Gi can be a BR to Gi. 

For certain benefit functions a star is not a BR either. The graph G 3 given in Figure [T(c)] has utility 46(1) + 
26(3) — 5c. This is better than the path graph if 26(3) — 6(1) > 6(5), and better than the star if 6(3) > 26(2) — 6(1). 
For instance if c = 1, 6(1) = 1.01,6(2) = 0.85,6(3) = 0.8, 6(4) = 0.2 and 6(5) = 0.1, then the graph G 3 is better 
than the star graph or any subgraph of Gi, i.e., u(G 3 |Gi) > u{G\Gi) where G C Gi or G = G 2 . In this example, 
one can verify (e.g., using a brute-force search) that G 3 is in fact a BR network to Gi. 

It is also instructive to consider the case where 6(1) = 6(2) = 6(3) > max{c, 6(4)}. In this case, the graphs 
shown in Figure |l(b)| and |l(c)| are both best response networks to Gi and have higher utility than any subgraph of 
Gi. 



The above example illustrates that BR networks to an arbitrary graph Gi are very sensitive to the relative values 
of the benefit function 6(-) and the cost c. Indeed, the shape of the entire benefit function can play a role in 
determining the best response to general graphs, whereas only the value of 6(1) and 6(2) matter when Gi is the 
complete graph (as shown in Proposition [T]). One of the main results of this paper is to formally characterize the 
complexity of finding a besf response network to a given graph. To do this, we first cast it as a decision problem 
(i.e., a question to which the answer is yes or no) as follows. 

Definition 1: Best Response Network (BRN) Problem. 

INSTANCE: A network Gi = {N, Ei), a nonincreasing benefit function 6 : {1,2, • • ■ , n — 1, 00 } —>^ K>o, an edge 
cost c G M>o and a lower bound on utility given by r G K>o. 

QUESTION: For the utility function u{-) given in equation Q, does there exist a G = {N,E) G G^ such that 

n(G|Gi) > r? (3) 

Assuming that the input size to a problem is n, if there is an algorithm that solves the problem in 0{n^) time 
(for some positive constant k), the problem is said to be in the complexity class P. A decision problem is said to 
be in the class NP if every “yes” answer has an accompanying certificate that can be verified in polynomial-time. 
Consider two decision problems A and B and assume that there exists a polynomial-time transformation from any 
instance 6 of problem B into some instance a of problem A such that the answer to 6 is “yes” if and only if answer 
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to a is “yes”. If such a transformation from B to A exists, it is called a reduction and problem B is said to be 
polynomial-time reducible to problem A. A problem A is NP-hard if for all problems B € NP, B is polynomial¬ 
time reducible to A; in particular, A is NP-hard if some other NP-hard problem B is polynomial-time reducible 
to A |[25l . An NP-hard problem that is also in the class NP is said to be NP-complete. The following theorem 
is one of our main results and shows that finding a BR with respect to an arbitrary graph with arbitrary cost and 
nonincreasing benefit functions does not have a polynomial-time solution, unless the answer to the long-standing 
open question of whether P = NP is affirmative. 

Theorem 1: BRN is NP-hard. 

We will develop the proof of Theorem [Dover the rest of this section. We will require some intermediate properties 
of best response networks, given by the following results. 

A. Some Properties of Best Response Networks 

Lemma 1: If G 2 = (A^, £' 2 ) is a BR network to Gi = then the number of edges in G 2 is less than or 

equal to the number of edges in Gi. If 6(1) > 6(2), then Gi and G 2 have an equal number of edges if and only 
if G 2 = Gi. 

Proof: We use contradiction to prove the first part. Suppose that G 2 is a BR and has more edges than Gi. 
Then 

u{G2\Gi)= Y. bidcAu.v)) - c\E2\ 

{u,v)€Ei 

< |£i|6(1)-c|£2| 

< |£i| 6 (l)-c|£i| =u(G'i|Gi), 

which contradicts our assumption that G 2 is a BR to Gi. To prove the second part, note that if G 2 = Gi then the 
number of edges in G 2 and Gi are equal. So we only need to show that when 6(1) > 6(2), if the number of edges 
in G 2 is equal to the number of edges in Gi, then G 2 = Gi. If G 2 7 ^ Gi, then there exists a (u, v) € £1 such that 
dca (u, v) > 2. Thus 

n(G 2 |Gi)= bidGAu,v))-c\E2\ 

{u,v)€Ei 

< |£i| 6 (l)-c|£i| =n(Gi|Gi), 

contradicting the assumption that G 2 is a BR to Gi. ■ 

The next lemma discusses the connectivity of BR networks. 

Lemma 2: Suppose that G 2 is a best response network to Gi and 6(1) > c. Then any two nodes that are connected 
by a path in Gi will also be connected by a path in G 2 . Specifically, if Gi is connected, then G 2 must be connected. 

Proof: Let u and v be two nodes that are neighbors in Gi. By way of contradiction assume that there is no 
path between u and v in the BR network G 2 = {N, £ 2 ). For G '2 = {N, £ 3 ) with £3 = £2 U {(n, u)}, 

n(G' 2 |Gi)-u(G 2 |Gi) > 6 (l)-c> 0 , 

contradicting the assumption that G 2 is a BR network. Now consider the case that u and v are connected through 
a path in Gi. Then there must be a path from tt to u in G 2 , since we showed that any two nodes that are directly 
connected in Gi remain connected in G 2 . ■ 

Remark 2: When 6(1) = c, the above proof can be applied to show that there exists a best response network in 
which any two nodes that are connected by a path in Gi will also be connected by a path in G 2 (although this 
does not have to be true of every best response network). 

For any integer f > 1, a subgraph H = {N,Eh) of Gi = (A^, £ 1 ) is called a t-spanner if dnix^y) < t for all 
{x, y) E El, i.e., the distance between each pair of nodes that are neighbors in Gi is not more than t in H ll26l . A 
subgraph T = {N, Et) of the graph Gi that is both a f-spanner and a tree is called a tree t-spanner. The following 
important lemma characterizes a BR to graphs that have a 2-spanner. 

Lemma 3: Suppose graph Gi = (A^, £ 1 ) has a spanning foresQ £ = {N,Ef) that is also a 2-spanner. Assume 
that 6(1) — 6(2) < c < 6(1). Then £ is a BR to Gi. 

'whenever Gi is a connected network, by a spanning forest of Gi we mean a spanning tree. 


6 


Proof: Assume that Gi has m components where m > 1. Since F is a spanning forest, \Ef\ = |A^| — m. 
Using the fact that dF{x,y) < 2 for all {x,y) G Fi, we have 

u(FlGi) = (|iV| - m)(5(l) - c) + (|Fi| - {\N\ - m))b{2). (4) 

Now assume that H = {N,Eh) is a best response network to Gi such that any two nodes that are connected in 
Gi are also connected in H. The existence of such a BR network is guaranteed hy Lemma |2] and Remark |2l Thus 
\Eh\ > — m. Also hy Lemma [T] we have \Eh\ < |Fi|. Since at most \Eh\ pairs of neighbors in Gi can be 

directly connected in H, the remaining |Fi| — \Eh\ pairs of neighbors in Gi will be at least a distance of two 
away from each other in H. Thus we have 

u{H\Gi) < lEnmi) - c) + (|Fi| - |Fj,|)6(2) (5) 

= (|Ar| - m)( 6 (l) - c) + (|Fh| - (|A^| - m))( 6 (l) - c) + (|Fi| - |Fh|)6(2) 

< i\N\ - m)( 6 (l) - c) + (|Fi| - (lA^I - m))6(2) 

= uiE\Gi). 

Thus F is a BR to the network Gi. ■ 

The next lemma provides lower and upper bounds on the utility of BR networks when 6(1) — 6(2) < c < 6(1). 
Lemma 4: Suppose that 6(1) — 6(2) < c < 6(1) and G 2 = (A^, F 2 ) is a BR network with respect to an arbitrary 
connected network Gi = (A^, Fi). Then 

|Fi|( 6 (l) - c) < u{G 2 \Gi) < (|iV| - 1)(6(1) - c) + (|Fi| - \N\ + 1)6(2). ( 6 ) 

Proof: The lower bound follows from the fact that u{G 2 \Gi) > u{Gi\Gi) = |Fi|( 6 (l) — c), by virtue of G 2 
being a BR network. For the upper bound, note that since 6(1) > c, G 2 can be assumed to be a connected graph 
(by Lemma |2] and Remark |2]) and thus IF 2 I > |A^| — 1. The rest of the proof follows the same procedure as in the 
proof of Lemma [ 3 ] with m = 1. ■ 

Remark 3: The inequalities given in the above lemma are sharp. As we will show later in this paper, a BR to a 
tree is the same tree if 6(1) > c. For a tree, the left and right hand sides of inequality ® are equal. Also, for a 
graph Gi with a tree 2-spanner T, we know that Fisa BR to Gi by Lemma |3 with utility equal to the right hand 
side of inequality ®. 

B. Proof of NP-Plardness of the BRN Problem 

We now return to the BRN problem (Definition [T]) and the claim of NP-hardness given in Theorem [T] To prove 
this theorem, we will construct a reduction from the Tree t-spanner Problem 1261, defined below. 

Definition 2: Tree f-Spanner (TtS) Problem. 

INSTANCE: A connected graph G = {N, E) and a positive integer t. 

QUESTION: Does G have a tree t-spanner, i.e., a subgraph T = [N, Et) such that \Et\ = |A^| — 1 and dxix^ y) 
for all (x, y) G El 

The TtS problem is in P for t = 2, but NP-complete for all t > 4; the complexity of the problem for t = 3 is 
still unknown ll26l . We are now in place to prove Theorem [T] 

Proof of Theorem 1: We will construct a reduction from the TtS problem to the BRN problem, which will 
then imply that the BRN problem is NP-hard. Consider an instance of the TtS problem with graph G = {N, E) 
and t = 4. Any spanning tree of G with |A^| < 5 is a tree 4-spanner which is easy to find. Thus, we assume fhaf 
\N\ > 6 . Define the corresponding instance of the BRN problem as follows. The network Gi = {N,Ei) is the 
same as the graph G, i.e., Gi = G. The benefit function 6 (-) and edge-cost c are chosen to satisfy 

6(1) > 6(2) = 6(3) = 6(4) > 6(5), 

6 ( 1 )- 6 ( 2 ) <c< 6 ( 1 ). 


Eor example c = 2, 6(1) = 3, 6(2) = 6(3) = 6(4) = 2 and b{k) = 0 Vfc > 5 satisfies fhese condifions. Einally set 

r = (|Ar| - 1)(6(1) - c) + (|Fi| - m - 1))6(2). (8) 
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Clearly we can construct the above BRN instance in polynomial time. Now assume that the answer to the instance 
of the TtS problem is “yes”, i.e., graph G has a tree 4-spanner T = {N,Et)- This means that T is a subtree of 
Gi and dT{x,y) < 4 for all {x,y) G Ei. Thus we have that 

n{T\Gi) = HdT{x,y)) + (|A^| - 1)(6(1) - c) 

{x,y)GEi\ET 

= {\E,\-{\N\-l))b{2) + {\N\-l){b{l)-c) 

= r. 

Note that we used the fact that 6(2) = 6(3) = 6(4) to go from the first line to the second line in the above equation. 
Therefore, the answer to the defined instance of the BRN problem is also “yes”. 

To complete the proof, we have to show that if the answer to the constructed instance of the BRN is “yes”, 
then the answer to the instance of the TtS is “yes”. In other words, we have to show that if there exists a graph 
G 2 = {N,E 2 ) such that 

u{G2\Gi) = ^ b{dG^{x,y)) - c\E2\ > r, 

{x,y)eEi 

where 6 (-) and c satisfy ([7]) and r is given by ([ 8 ]), then Gi has a tree 4-spanner. We claim that any G 2 with utility 
at least r must be a tree 4-spanner of Gi. 

Assume that G 2 = (A^,S 2 ) is a graph with u{G 2 \Gi) > r. Since r is equal to the upper bound of the utility 
of the BR (by Lemma |4]), G 2 must be a best response to Gi. Since 6(1) > c, by Lemma |2] we know that G 2 is a 
connected graph. Therefore, |£^ 2 | > |A^| — 1- First consider the case that 1 ^ 2 1 > |A^| — 1- Then similar to equation 
dD, we have that 

u{G 2 \Gi) < \E 2 mi) -c) + (|i?i| - \E2\)b{2) 

= (|Ar| - 1 )( 6 ( 1 ) - c) + {\E 2 \ - (|iV| - 1 ))( 6 ( 1 ) - c) + m\- \E 2 \)b{ 2 ) 

< (|iV| - 1)(6(1) - c) + {\E 2 \ - (|iV| - 1))6(2) + (lEil - 1 ^ 21 ) 6 ( 2 ) 

= (|Ar| - 1)(6(1) - c) + (|i?i| - (|Ar| - 1))6(2) = r, 

which is a contradiction. Thus consider the case that \E 2 \ = |A^| — 1, i.e., G 2 is tree. Denoting |£^2 n ii^i| = 7 , we 
have 

u(G 2 |Gi) = 7 ( 6 ( 1 ) - c) - (|A^| - 1 - 7 )c-h b{dG^{x,y)) (9) 

(x,y)£Ei\E2 

< 7 ( 6 ( 1 ) - c) - (|A^| - 1 - 7 )c -h (|T;i| - 7 ) 6 ( 2 ) 

= 7(6(1) - c) + (|Ar| - 1 - 7 )( 6 ( 2 ) - c) + i\E,\ - (|iV| _ i))6(2). 

If 7 < |A^| — 1, since 6(1) — c > 6(2) — c, by equation ® we have that 

n(G 2 |Gi) < 7 ( 6 ( 1 ) - c) + (|iV| _ 1 _ ^)(6(1) - c) + {\Ei\- (|iV| - 1))6(2) = r, 

which is again a contradiction. Therefore, \E 2 n i^il = 7 = |A^| — 1. This means that G 2 is a subtree of Gi. Now 
if there exists {u,v) G Ei such that dG^iu^v) > 4, then we have 

u(G 2 |Gi) = m - 1)(6(1) - c) + b{dGAx,y)) 

{x,y)£Ei\E2 

<{\N\-l){b{l)-c) + {\E,\-{\N\-l))b{2) 

= r, 

where the last inequality follows from the fact that 6(2) = 6(3) = 6(4) > b{d) for all d > 4. Therefore, for all 
{u,v) G El, dG 2 {'u,v) < 4 which means that G 2 must be a tree 4-spanner for the graph Gj. Thus the answer to 
the instance of the TtS problem is “yes”. This shows that the NP-hard problem TtS (for t = 4) is polynomial-time 
reducible to BRN, and therefore BRN is NP-hard. ■ 

There are certain NP-hard optimization problems (e.g., minimum vertex cover) whose solutions can be approx¬ 
imated to within a constant factor by simple greedy algorithms ll25l . The following example considers a natural 


greedy algorithm where edges are added or removed one at a time, and shows that this algorithm can produce 
results that are arbitrarily far away from the optimal network. 

Example 2: Consider a greedy algorithm where at each step, we add or remove a link that provides the highest 
increase in the utility until no further improvements can he made. The following scenarios illustrate the pitfalls of 
such an algorithm. 

Consider a reference network Gi. Suppose we attempt to huild a BR network hy starting with an empty network 
G and repeatedly adding edges. If 5(1) < c, then adding any single edge to G will result in negative utility, and 
thus the algorithm stops with the empty network. Since there can exist nonempty BR networks when 5(1) < c 
whose utility is unbounded in n (e.g., see Proposition [T]), the network produced by the above algorithm can be 
arbitrarily bad in comparison to the true BR network. 

Now suppose that we attempt to build a BR network by starting with the reference network Gi and removing 
edges one at a time. Consider the graph Gi depicted in Figure 2(a) and define c = 1, 5(1) = 5(2) = 0.5, 

b{k) = 0 for 3 < A: < n — 1. 

Starting with G\, removing any of the edges increases the utility by 5(2) — (5(1) — c). Thus any edge is a candidate 
for removal. Consider removing the edge (^ 1 ,^ 2 ) which results in network G 2 . Now no further improvements are 
possible by adding or removing a single edge. Next, consider network G 3 shown in the Figure 2(c) As we will 
show in Proposition [3] in Section IVll G^ is a best response network to Gi- We have 

(n - 1)(5(1) - c) + {n- 2)5(2) 


n^oo u(G2|Gi) 


= lim 

n—>-oo 


2(n-2)(5(l) -c) + 5(2) 


= 00 . 





Fig. 2 : Performance of a greedy algorithm. Graph Gi in (a) is the reference network. Graph G2 in (b) is the output of the 
greedy algorithm discussed above. Graph G3 in (c) is a best response to Gi. 

Note that same conclusion is reached even if we start with the complete graph, i.e., we can remove the edges in 
such a way that we end up in network G 2 . Thus this greedy algorithm can perform arbitrarily poorly in comparison 
to the optimal solution. 

An important avenue for further research is to find approximation algorifhms (and achievable approximation 
ratios) for fhe BRN problem. 


C. Comparison to Other Network Design Problems 

The problem of opfimally designing networks is classical in the computer science and algorithms literature. 
Perhaps the most common instance is the minimum spanning tree (MST) problem which is to find a spanning free 
of a weighfed graph fhaf has fhe leasf overall weigh!; fhere are greedy algorifhms fhaf solve MST in polynomial lime 
Il25l . Here, we compare fhe BRN problem fo two canonical network design problems that also attempt to minimize 
distances between pairs of nodes: the Optimal Communication Spanning Tree (OCST) problem introduced in 1271 . 
and the Simple Network Design problem (SNDP) introduced in 1281 . 

In the OCST problem, for each pair of nodes Vi,Vj € N, there is a communication requirement Vij € N. The 
goal of the network designer is to construct a tree T on the node set N such that is minimized. 

This problem is polynomial-time solvable for any set of rjj ll27l . 

In the SNDP problem, one is given an undirected graph G = {N, E) and a criterion C G N. The objective is to 
determine if there exists a subgraph G' = {N, E') of G with at most |A^| — 1 edges such that dcihj) < C. 
It was shown in that this problem is NP-complete. 

The relationships between the BRN, OCST and SNDP problems are as follows. 








• The OCST and SNDP problems explicitly constrain the number of edges in the designed network, whereas 
the BRN problem includes the cost of edges in the utility function. 

• The SNDP problem requires the designed network to be a subgraph of another given network, whereas the 
BRN and OCST problems place no such constraint. 

• The objective of the SNDP problem is to minimize the sum of distances between all pairs of nodes, whereas 
the BRN and OCST problems allow the objective function to only depend on distances between selected pairs 
of nodes (the OCST problem does this by setting = 0 for those pairs that do not wish to communicate). 

Despite the apparent similarities between the BRN problem and the OCST problem. Theorem [T] shows that the 
BRN problem is NP-hard, even though the OCST problem can be solved in polynomial-time. This increase in 
complexity is a byproduct of the additional flexibility afforded by the general nonincreasing benefit function in the 
BRN problem (as opposed to the scaled distances in the utility function for the OCST problem), which allows it 
to capture the tree-f-spanner problem as a special case. 

In the next section of this paper, we will characterize further properties of BR networks; these will allow us to 
find BR networks with respect to certain specific classes of graphs, which in furn will allow us to formulate and 
study a multi-layer network formation setting with multiple network designers. 


V. Further Properties oe Best Response Networks 


The proofs of all results in this section are given in Appendix lAl 

Lemma 5: Let G 2 be a BR network to Gi, and suppose that G 2 is not connected. Let G 2 i = (Aj, E 2 i), 
f = 1,..., /c, be the components of G 2 - Let Gu = (Aj, Eu), f = 1, 2,..., fc, be the subgraphs induced by vertex 
sets Aj on Gi. Then network G 2 i must be a BR network to Gu for f = 1, 2,... , /c. 

The following lemma considers the case when there are isolated nodes in Gi. 

Lemma 6: Let Gi = (A, Ai) and suppose u G A is an isolated node. Then v is isolated in any BR to Gi. 

The properties described above are independent of the relative values of the benefit function and edge costs. The 
following set of results provide more details of the BR networks for certain ranges of benefits and costs. 

Lemma 7: Let Gi = (A, Ai) be an arbitrary graph. 

1) If 6(1) - c > 6(2), then the unique BR network to Gi is G 2 = Gi. 

2) If 6(1) < c, then Gi is not a BR network to Gi, unless Gi is the empty network. 

3) Define 


A \EgAS,S)\ ^ 

a = max ——-^-1, 

2<|5|,SCAr |5| - 1 


( 10 ) 


where Eq^ {S, S) denotes the set of edges in Gj that have both of their endpoints in the set S, i.e., Eq^ {S, S) = 
El n {S X S). If c > 6(1) -|- a6(2), then the unique BR network with respect to Gi is the empty network. 

The parameter a is a measure of the edge density of the underlying graphj^ and thus the threshold to have the 
empty network as the best response network increases as the underlying graph becomes more dense. The following 
example illustrates the implication of a for various graphs. 

Example 3: In the following, we define |A| = n. 

• Assume that Gi = (A, Ai) is the complete graph. Then |A( 5 j(S, S)| = (^ 2 ^) for any (non-singleton) S' C A 
and thus a = in equation (fTOl) . This means that the BR to the complete graph is the empty graph for 
c > 6(1) -|- ^^^6(2), yielding part (iii) of Proposition [T] (obtained in 0) as a special case of Lemma |7] 

• Suppose that Gi = (A, Ai) is a tree. Since any induced subgraph of a tree is a forest (it is a tree when the 
subgraph is connected), we have |Ag'j(S, S)| < |S| — 1 for any non-singleton S C A. Thus 


|S|-1 


- 1 < 0 


vs C A, |S| > 2. 


This means that a = 0 (which happens for any S that induces a connected subgraph on Gi). Therefore, we 
can conclude that the BR network to a tree is the empty network when c >6(1). 


^There exist efficient algorithms to find maximally dense subgraphs in networks 1291 . 
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• Consider a cycle graph Gi = {N,Ei) with n nodesj^ Any induced subgraph of Gi on a non-singleton node 

set 5 C is an acyclic graph and thus (S', S')! < IS"! — 1. For S' = we have N)\ = n. Thus 

a = and the BR network to Gi is the empty network for c > 6(1) + ^ 5 ^ 6 ( 2 ). 

Remark 4: Note that when 6(2) = 0, Lemma |7] indicates that for 6(1) > c, Gi is a unique BR to itself (for any 
network Gi), and when 6(1) < c, the empty network is a unique best response (both Gi and the empty network 
are best responses with utility 0 when 6(1) = c). Thus, in the rest of the paper, we will assume that 6(2) > 0. 

In the next lemma, we consider the case that we have nodes with degree one in the graph. 

Lemma 8: Let Gi = {N,Ei), and suppose u € A^ is a leaf node. Define the induced subgraph of Gi under the 
node set N \ {u} as Gn = {N \ {ulji?!!) (i.e., the graph obtained by removing node v and its incident edge). 
Then a BR to Gi can be obtained by first finding a BR to Gn and then adding v as an isolated node if 6(1) < c, 
or adding v together with a single edge to its neighbor in Gi if 6 ( 1 ) > c. 

The above lemma provides the following method to simplify the task of finding a besf response nefwork. Given 
a graph Gi, we recursively remove nodes of degree 1 until we are leff wifh a graph where all nodes have degree 
two or larger (this is known as peeling the graph, and the resulting subgraph is known as a 2-core ||30l). A best 
response to the 2 -core can then be found using whatever means necessary, and then the removed nodes can be 
recursively added back as isolated nodes (if 6 ( 1 ) < c), or with the single edge that was removed (if 6 ( 1 ) > c). 

VI. Best Responses to Specific Networks 

We will now apply the above results to characterize best responses to acyclic networks and networks with a 
star subgraph. The latter models, for example, sensor or communication networks where one or more base stations 
or fusion centers wish to communicate with all nodes, while the other nodes only need to communicate locally 
amongst themselves. The proofs of the following two propositions are provided in Appendix iBl 

Proposition 2: Let Gi = {N, ) be a forest. 

. If 6(1) < c, the empty network is the unique BR to Gi. 

. If 6(1) > c, then G 2 = Gi is a BR network to Gi. 

• If 6(1) = c, the empty network and G 2 = Gi are both BR networks to Gi. 

• For 6(1) > max{6(2),c}, the unique BR to Gi is G 2 = Gi. 

Proposition 3: Let Gi = {N, ) be a graph that has a star subgraph centered at node v ^ N. 

• If 6(1) — 6(2) > c, then Gi is the unique BR to Gi. 

• If 6(1) — 6(2) < c < 6(1), then the star network centered at node u is a BR network to Gi. 

• If 6(1) < c, one of the following networks is a BR to Gp. 

1) A star network on N with center at node v. 

2) A network where one component is a star and all other components are isolated nodes. 

3) The empty network. 

VII. Multi-Layer Network Formation Game with Strategic Substitutes 

In the previous sections, we considered the scenario where a network designer chooses an optimal graph (or layer) 
G 2 with respect to a given graph Gi. In this section, we will build on this formulation to consider a scenario where 
multiple network designers are building layers, with a utility for each layer that depends on the structure of that 
layer and the layers constructed by the other designers. This models, for instance, different mail and courier service 
companies designing their individual networks to service their customers, or different transportation networks (air, 
rail, bus) arising between a set of cities |[ 2 TI . lf24l . ISH- We start by defining an m-player game where each player 
corresponds to one of the layers. 

Definition 3: A Multi-Layer Network Formation Game has a set of m players P = {Pi, P 2 ,..., Pm }■ The strategy 
space for each of the players is defined to be G^ where N = {vi,V 2 , ■ ■ ■, Vn}, i-e., the set of all graphs on node 
set N. For each z € {1, 2,... , m}, let Gi = {N, Ei) G G^ denote the action of player Pi. The utility of player Pi 
is given by a function Ai : G^ x G^ x • • • x G^ —M, where the argument is the action of the player for 
1 < j < m. 

^A cycle graph with n nodes consists of only one cycle of length n. 


11 


We will use G-i to denote the vector of actions of all players except player Pi, and use Ai{Gi, G-i) to denote 
the utility of player Pi with respect to the given vector (Gi, G 2 ,..., Gm)- Based on the definition of the game, we 
say that a vector of networks (Gi, G 2 ,..., Gm) is a Nash equilibrium if and only if Gj € arg max^j Ai{G, G-i) 
for all f € {1, 2,, m}. In this case, Gj is said to he a BR network to G-i with respect to the utility function Ai. 

The characteristics of the game and the optimal strategies for each player will depend on the form of the utility 
functions A^. Here, as a starting point for studying such games, we will focus on distance-hased utilities (thereby 
building on our results from the first part of the paper). The reference networks for the distance-based utility function 
for each player will depend on the networks constructed by the other players. In the remainder of the paper, we will 
explore functions that view different layers of the network as strategic substitutes, where the presence of a link in 
one layer makes it less desirable for that link to appear in another layer; this captures the notion that the different 
network layers are attempting to fill gaps in connecfivify leff by fhe ofher layers 0 As a mofivafing example, consider 
competing fransportafion companies offering services between a common set of cities. Suppose that for economical 
reasons, each company would prefer to design its transportation network to provide short routes between those 
cities that are not directly serviced by any other company. In other words, each company designs its network with 
respect to the complement of the transportation networks provided by all other companies. If we impose further 
structure on such games by assuming distance-based utility functions, we obtain the game defined below. In fhe 
following definition, for a set of graphs Gj = {N,Ej), j = 1,2,... ,m, on a common set of nodes, we use the 
notation U^L^Gj to indicate the graph G = {N,LI)p=iEj), and nJTj^Gj to indicate the graph G = {N,r\)p=iEj). 

Definition 4: The game in Definition |3] is said to be a Multi-Layer Network Formation Game with Strategic 
Substitutes and Distance-Utilities if the utility functions are of the form 

Ai{Gi,...,Gm)=ufiG,\ ~ (u”Li,,.^,G,)) (11) 

= XI bi{dGfix,y)) - Ci\Ei\, 

where the function Ui is defined in ([2]); fhe benefif functions 6j(-) are nonnegafive, nonincreasing and satisfy 
bi{oo) = 0, and all costs c* are positive. The benefit functions and costs can be different for the different players. 

It is clear from the definition of the game that (Gi, G 2 ,..., G^) is a Nash equilibrium if and only if for 
all 1 < i < m, Gj is a BR network with respect to ~ {U'JLi j^iGj) for the utility function dD. Although we 
showed in Theorem [T] that finding a BR network with respect to this utility function is NP-hard in general, we 
now show that certain insights can nevertheless be obtained in the multiplayer setting (regardless of the number 
of nodes and players). To develop our results, we partition the set of players P into three sets: high-cost players 
Sh = {Pi € P\ci > bi{l)}, medium-cost players Sm = {Pi G > P > bi{l) — bi{2)} and low-cost players 

Sl = {Pi € P\bi{l) — hi{2) > Ci}. We start by considering the case where the game contains low-cost players. 

A. Games Containing Low-Cost Players 

Proposition 4: Suppose |5£,| > 1. Then in every Nash equilibrium, every player in Sh chooses the empty network. 
Furthermore, any vector of disjoint networks (Gi, G 2 ,..., Gm) forms a Nash equilibrium when {Gk\Pk £ Sm} is 
a set of disjoint forests and Ujg^^Gj = ~ Uig5„Gj. 

Proof: Let (Gi, G 2 ,..., Gm) be any vector of networks in Nash equilibrium. Since there exists at least one 
player Pi whose edge cost satisfies c* < 6j(l) — 6j(2), fhe Nash equilibrium vector must satisfy U^^Gj = G'^, where 
G^ is the complete network. To see this, suppose that the union of the graphs is not the complete network; then 
there exists some edge {u, v) that does not appear in any network, and thus appears in the complement of the graph 
j^-Gj. By Lemma |7J the BR to ~ j^-Gj with respect to player Pfs utility function is ~ 
and thus the edge {u, v) appears in graph Gj, contradicting the fact that it does not appear in the union of all the 
graphs. 

Next, note that since U^^Gj = G'^, for any player P^ € P, the graph Gk = {N,Ek) is a BR to the graph 
G'^ \ {U™ ij,^i^Gj} C Gk- By Lemma m a BR to a graph cannot be a strict superset of that graph, and thus we 
have that Gk is a best response to itself with respect to the utility function of player Pk. Now if Pk € Sh, we 

"'One can also consider a strategic complements version of this class of games where each player wishes to provide short paths between 
those pairs of nodes that share an edge in each of the other layers. The analysis of such games is relatively straightforward and thus we 
focus on strategic substitutes in this paper. 
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know from Lemma |7] that Gk must be the empty network, completing the first part of the proof. For the second 
part, note that for any vector of networks satisfying the given properties. Proposition |2] and Lemma |7] indicate that 
a best response to Gk is indeed Gk for Pk € Sm ^ Sl, completing the proof. ■ 

The above result shows that the presence of a player with low edge costs (relative to its own benefit function) 
guarantees the existence of a Nash equilibrium in the game, and furthermore, such low-cost players drive players 
with sufficiently high edge costs out of the game; the proposition provides the threshold for costs at which this 
occurs (namely bi{l) < Ci). Players with medium edge costs, on the other hand, can obtain certain nonempty 
networks in equilibrium, and the players with low edge costs split all of the remaining edges amongst themselves. 

We now study the situation where there are no low-cost players in the game (i.e., = 0). We start by considering 

games that contain only high-cost players. 


B. Games Containing Only High-Cost Players 

Suppose P = Sh (i-E-, Sl = Sm = 0)- For each player Pi G P, define fhe index ki as 

ki = min|f G N | c* < bi{l) H-^6j(2)| . 

Since Ci > bi{l), we have ki > 3 for all Pi G P. If ki > n, by Lemma|7l the empty network is a BR of player Pi to 

any set of networks G-i (since a in (fTOl) satisfies a < for any reference graph). Thus wifhouf loss of generality, 

assume that all players have 3 < ki < n and players are sorted according to their ki, i.e., ki < k 2 < ■ ■ ■ < km < n. 
We will now partition the set of players P into different sets. 

Define fhe index zi as 

ii = max {z G {1, 2,... , m} \ ki < n — i + 1} . 

Nexf, define 

i 2 = max {i G {1, 2,... , ty — 1} | fci < fi — z -|- 1} , 

Z 3 = max {z G {1, 2,... , Z 2 - 1} | fc* < Z 2 - * + 1} , 

; ( 12 ) 

ir = max {z G {1, 2,... , z^-i — G) \ki < P-i — z -|- 1} , 

where ty satisfies ir < ki (so that no further sets of this form can be defined). 

The above indices satisfy 1 < ty < Zr-i < • • • < zi < m. Partition fhe sef of players and nodes as follows 


Hr — {Fl) • • • ) Pi,. }) Lr — ) • • • 5 } 

Hi 1 = , • • • , Pir-1 }) —1 — {^ir-l-1) • • • 5 '^ir-1 } 


: (13) 

Hi {Fj2-i-i } ■ ■ ■ 1 Hi ^}, Vi {zzjj+i) • • •) Vi ^}. 

Also define Hq = {Pq+i, F’q+2, • • •, Pm} and Vb = {vii+i,Vi,+2, • • •, Vn}- 

Proposition 5: For each player R G Hi (for 1 < Z < r), define fhe nefwork Gj fo be fhe sfar nefwork cenfered on 
node Vj with peripheral nodes Uj'qVj, where Hi and Vt are defined as in equation (fTSl) . For each player Pj G Hq, 
define Gj to be the empty network. Then the set of networks (Gi, G 2 , ■ ■ ■, Gm) forms a Nash equilibrium. 

The proof of the above proposition is given in Appendix O The following example illustrates the structure of 
the Nash equilibrium specified by the above proposition. 

Example 4: Suppose that there are 11 nodes and 9 high-cost players with Zcj = 3 for 1 < z < 5, /cg = 4, kj = 5 
and ks,kQ > 5. From the equations in (fT^ . we get zi = 7, Z 2 = 5 ,Z 3 = 3 and = 1. Figure [3] demonstrates 
the networks of players Pi, P2, P 4 and Pq in the Nash equilibrium defined in Proposition |5] Player P 3 has a 
similar network to player P2 (except that the star of her network is centered on vs). Players P 5 and P 7 have similar 
networks to that of P 4 and Pq, respectively (the only difference being that player P 5 has a star centered on zzg, and 
Pj has a star centered on vj). Players Pg and Pg each have the empty network. 

Despite the stylized nature of the multi-layer network formation game in Definition |4l it is of interest to note that 
the “hub-and-spoke” networks that arise in the above Nash equilibrium are predominant in real-world transportation 
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V4 V3 1/2 Fl Vo 


(a) Partition of the nodes. 



(d) Gi 


(e) Ge 


Fig. 3 : A multi-layer network formation game considered in Example^ with 9 high-cost players and 11 nodes. Nodes are 
partitioned into 5 sets as shown in Figure \ 3 (a)\ based on the characteristics of the players. Each node in each of the sets 
^1:1^2, V3, V4 will be chosen by a different player as the center of a star subgraph in the Nash equilibrium. The Nash equilibrium 
networks of players Pi, P2, Pi and Pq are shown in \ 3 (b)\ \ 3 (c)\ \ 3 (d)\ and \ 3 (e)\ respectively. The networks of players P3, P5 
and P7 are not shown; they have stars centered on V3, V5 and vt, respectively, with the same peripheral nodes as P2, Pi and 
Pq, respectively. Players Pg ond Pg choose the empty network. 


systems (airline networks, in particular) 1 ^ . 1241, ED- While previous work has shown that such networks are 
optimal in the single-layer setting (e.g.. Proposition [U 0), our analysis shows that these structures also arise when 
players selfishly optimize their individual networks in competitive environments. We will now consider games with 
a mix of medium-cost and high-cost players, and show that such structures also arise as a Nash equilihrium in that 
setting. 

C. Games With Medium and High-Cost Players 

Proposition 6: Suppose that Nl = 0, and assume without loss of generality that the first p players in P 
are medium-cost players, with 1 < < n. For j G {1,2,... ,/r}, define the network Gj to he the star net¬ 

work centered on node Vj with peripheral nodes {uj+i, Uj+ 2 , • • •, Un}. For the set of high-cost players Sh, let 
G/i+ 2 ) ■ ■ ■) Gm) be the Nash equilibrium networks on node set (u^+i, u^+ 2 , ■ ■ ■ j Vn} defined in Proposifion[5] 
Then fhe sef of nefworks (Gi, G 2 ,..., Gm) is a Nash equilibrium. 

Proof: In the proof, we will use the fact that each network Gj, 1 < j < m, only contains edges from node Vj 
to nodes with index larger than j. For each player Pj, let Gj^ref — UI^i ifj Gj be the union of the networks of 
the other players. 

Consider a medium-cost player Pj, where j € (1,2,... ,p}. Since all players with index smaller than j are 
medium-cost players, for each node v; with i < j, Gj^ref contains an edge from node v; to Vk for all k > 
i. Furthermore, Gj^ref contains no edge from Vk to Vj for any k > j. Thus, in the network ~ Gj^ref-> nodes 
vi,V 2 , - ■ ■ ,Vj-i are isolated, and there is an edge from Vj to each node Vk with k > j. By Lemma 0 the isolated 
nodes in ~ Gj^ref remain isolated in the BR; applying Proposition [3j a star network centered at Vj with edges to 
{vjj^i ,..., Vn} is a BR with respect to ~ Gj^ref- Thus, Gj is a BR to ~ Gj^ref- 

Now consider a high-cost player Pj, where j G p-\-2,..., m}. Arguing as above, nodes vi,V 2 , ■ ■ ■ ,v^ are 

isolated in the network ~ Gj^ref- Thus by Lemma 0 those nodes remain isolated in the BR to ~ Gj^ref- Since this 
is true for all high-cost players, we can remove the nodes vi,V 2 , ■ ■ ■ ,v^ from consideration, and focus on showing 
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that the subgraph of Gj induced hy the node set {u^+i, u^_|_ 2 ,..., i;„,} is a BR to the graphs (G^+i, G^+ 2 , • • •, Gm) 
on that node set. This is true hy construction, and thus the given set of networks is a Nash equilihrium. ■ 

Example 5: Consider a game with 13 nodes, 2 medium-cost players (Pi and P 2 ) and 9 high-cost players 
(P 3 , • • • ,Pii). Assume that the 9 high-cost players are the same as the high-cost players in Example |4l Based 
on Proposition |3 each of the medium-cost players Pi and P 2 will have a star network centered on node vi and 
V 2 , with peripheral nodes V \ {ui} and V \ {vi,V 2 }, respectively. These networks are shown in Figure [4^b)] and 
|4(c)| respectively. The networks of the remaining players (which have high costs) have the same structure as in 
Example |4] with two extra isolated nodes, vi and V 2 - Once again, we see that huh-and-spoke networks arise as a 
Nash equilihrium in this setting. 



M Vi V 3 V 2 Vi Vo 


(a) Partition of the nodes. 



(b) Gi 


(c) G 2 


Fig. 4: Figure [4(a)| demonstrates the partition of the set of nodes into 6 sets. The first set (denoted M) contains nodes that 
will form the centers of the star networks chosen by the medium cost players Pi and Pj. These star networks are depicted 
in Figures 4(b)\ and 4(c)\ The networks of the remaining high-cost players have the same structure as the networks shown in 


Figures \3(bj to \3(e) 


with vi and V 2 as isolated nodes. 


The following corollary immediately follows from Propositions IH |5] and 0 

Corollary 1: The multi-layer network formation game with strategic substitutes and distance-utilities has a pure 
Nash equilibrium for any set of players. 


VIII. Summary and Future Work 

In this paper, we introduced and studied the problem of strategic multi-layer network formation. We generalized 
distance-based network formation to multi-layer networks, and showed that the problem of finding an optimal 
network in this setting is NP-hard. We characterized certain properties of optimal networks, and found the optimal 
networks for certain special cases of reference graphs. Next, we formulated a multi-layer network formation game 
where each player builds a different layer of the network. When the layers are viewed as strategic substitutes, we 
showed that the Nash equilibria of the game exhibit certain natural characteristics. Specifically, the presence of low- 
cost players pushes high-cost players out of the game, and huh-and-spoke networks arise in the Nash equilibrium 
when there are no low-cost players. 

There are many interesting avenues for further research. (1) Deriving approximation algorithms with provable 
performance guarantees is a natural approach to dealing with the inherent complexity of finding opfimal nefworks; a 
deeper invesfigafion of fhe connections befween f-spanners and fhe besf response nefwork design problem mighf lead 
fo such algorifhms. (2) Our inifial simulations show fhat sequenfial best response dynamics converge to the Nash 
equilibria that we identified in this paper; providing formal proofs of convergence and understanding other non- 
simultaneous variants (such as Stackelberg games) is an important avenue for research. (3) While we have focused 
on distance-based utilities with strategic substitutes in this paper, it would also be of interest to study other classes 
of utility functions in the multi-layer network formation game. (4) A mechanism to incorporate stochasticity and 
partial information into the network formation process would be of value in modeling and gaining further insights 
into the formation of realistic networks. 
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A. Proof of Lemma 0 

Proof: Consider the utility of network G2 with respect to Gi- Since there are no edges between the components 
in G2, for any (rt, v) € Ei with u and v in different components of G2, dc^ (u, v) = cx). Thus 'f 2 {u v)£Ei ('^j '^')) = 
Yli=i '^{uv)£Ei ^(.dG2i{uiv)), and the utility function can he written as 

u{G 2\G,)= bidG2{u,v))-c\E2\ 

(u,v)gEi 

= b{dG 2 Au,v))-c\E 2 i\ 

*=1 \{u,v)£Eii 

= uiG 2 i\Gu) + --- + u{G 2 k\Gik). 

Now, if G2i is not a BR to Gu for some i € {1,2,.. . ,k}, replace it with a BR. This will increase the utility, 
contradicting the fact that G 2 is a BR. ■ 

B. Proof of Lemma^ 

Proof: Let G 2 = {N,E 2 ) he a BR network with respect to Gi, and suppose hy way of contradiction that 
V not isolated in G 2 . If u is a leaf node in G2 (i.e., it has a single neighbor), then the edge incident to v is not 
used in any of the shortest paths between nodes in A" \ {u}. Removing that edge increases the utility of G 2 by c, 
contradicting the fact that it is a BR. 

Now suppose that v has two or more neighbors in G2, and denote those neighbors by the set J = {vj^, Vj^ } C 
N \ {u} with I > 2. Construct a new network G 3 = (A, E^) with 

E 3 = {E 2 \ {iv,vj,),iv,vj,),...,{v,vj,)})u{ivj,,vj,),{vj,,vj,),...,{vj,,vj,)}, ( 14 ) 

i.e., we remove the I edges from v to its neighbors and add edges from Vj^ € J to the other nodes in J. This results 
in a net removal of at least one edge from the graph. Suppose that the shortest path between some pair of nodes 
in A \ {u} passed through u in G 2 ; the shortest path now passes through Vj^ in G 3 , and is at least as short as the 
original shortest path. Thus n(G 3 |Gi) > u{G 2 \Gi) which contradicts the assumption that G 2 is a BR network to 
Gi. Therefore, v must be an isolated node in G 2 . ■ 


C. Proof of Lemma^ 

Proof: In order to prove the first property, assume by way of contradiction that G 2 is a BR network and 
G 2 7 ^ Gi. Since 5(1) > 5(2), by Lemma [T] we know that the number of edges in G 2 is less than in Gi. So there 
are vertices u and v such that {u,v) G Ei and dG^iu^v) > 1. Adding the edge {u,v) to E 2 increases the utility 
by at least 5(1) — c — 5(2) > 0 which contradicts the assumption that G 2 7 ^ Gi is a BR network. Therefore, the 
BR network must be equal to Gi. 

For the second property note that if G 2 = Gi 7 ^ f, then u(G 2 |Gi) = |Ai|(5(l) — c) < 0 due to the assumption 
that 5(1) < c. Thus it must be the case that G 2 7 ^ Gi, or Gi is the empty network. 

Finally in order to prove the third property, consider an arbitrary graph Gi = (A, Ai) with n nodes. By way of 
contradiction assume that G 2 7 ^ </) is a BR network with respect to Gi. Let G 21 = (Ai, A 21 ) be a component of 
network G 2 with 1 < |Ai| < n. By Lemma [5J we know that G 21 must be a BR to the subgraph induced by the 
node set Ai on Gi, which we denote by Gu = (Ai, An). Thus 


r(G' 2 i|Gii) < |A2i|(5 (1) - c) + (|An| - |A2i|)5(2) 

= |A2 i|(5 (1) - c + a5(2)) + (|An| - |A 2 i|(l + a))5(2) 

(\Eu\ 

\\E2i\ 


|A 2 i|( 5 (l)-c + a5(2)) + |A 2 i| 


(1 + a) 5(2). 


(15) 
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Due to the assumption that c > 6(1) + ab{2), the first term in (fTSl) is negative. Also, we have that 

|j^n| ^ \En\ _ \EGdNi,N,)\ ^ \EgAS,S)\ ^ 

|^2l| “ |iVl| - 1 ~ 2<\S\,SCN IS*! - 1 

The first inequality above follows from the fact that G 21 is a component and thus has at least |A^i| — 1 edges. Thus 
the second term in equation (fTSl) is nonpositive. Therefore, u(G' 2 i|Gii) < 0 which is a contradiction. As a result 
G 21 (and thereby G 2 ) must be the empty network. ■ 

D. Proof of Lemma^ 

Proof: Let the neighbor of v in Gi be denoted by u, and assume that network H = {N,Eh) is a BR to 
network Gi. We reason as we did in the proof of Lemma [6l with a few additional details. 

Consider the case that 6(1) < c. Suppose that node v is not isolated in H, and let J = {vj^,Vj^,..., } C A^\{u} 

be the neighbors of u in L6. If f = 1 (i.e., v has a single neighbor in H), the edge {v,Vjf) is not used in any of 
the shortest paths between nodes in \ {u}. Removing that edge saves a cost of c, and loses at most a benefit of 
6(1) (due to the loss of the path from u to u in H). Since 6(1) < c, the resulting graph has utility at least as large 
as H. 

Now suppose I > 1. Construct the new network Hi = {N,Ehi) with edge set 

Eh, = (Eh \ {(^i, Vj,), {v, ..., {v, Vj,)}) U {{vj ,, Vjf), {vj, {vj,, Vj,)}. (16) 

In other words, we remove all of the incident edges from v in H and add edges from each node in J \ to 
Vj^. This saves at least one edge, and dH,{x,y) < dH{x,y) for all {x,y) G Ei. Thus, the only drop in utility in 
graph Hi arises from the loss of the path from node v to u. Again, since 6(1) < c, the graph Hi has utility at least 
equal to the utility of the network H and thus Hi is also a best response. The above two cases show that when 
6 (1) < c, there exists a best response where the leaf node v is isolated. 

Now consider the case where 6(1) > c. Then by Lemma|2]and Remark|2j there exists a BR network H = {N, Eh) 
containing a path from v to u. If u is a leaf node in H, it is straightforward to show that there exists a BR network 
H' where v is connected to u. Thus suppose v is connected to the node set J = ... ,Vj^) N \ {u} in 

H, with I > 2. Construct a new graph H 2 = {N, Enf), where Eh^ — Eh, U {(u, n)} with Eh, as defined in (IT^ . 
Arguing as above, the utility of H 2 is at least as high as the utility of H, and thus H 2 is a BR to Gi. Since the 
edge (u, u) cannot be in the shortest path between any pair of nodes in \ {u}, we see that the subgraph of H 2 
induced by A^ \ {u} must be a best response to the corresponding subgraph of Gi. This proves the result. ■ 

Appendix B 

Proofs for Section IvT] 

A. Proof of Proposition |2] 

Proof: When 6(1) < c, we use part 3 of Lemma |7] Following the same argument as in Example [3] for trees, 
we have a = 0 for Gi. Thus the unique BR network to a forest is the empty network when 6(1) < c. 

For 6(1) — 6(2) > c, the unique best response to any network is the same network by the first part of Lemma |7] 
For 6(1) — 6(2) < c < 6(1), note that Gi is a 2-spanner forest of itself, and thus Gi is a BR to itself by FemmaO 
proving the second statement. Since this BR has a utility of zero when 6(1) = c, the empty network is also a BR 
for this value of c, proving the third statement. 

Finally, we prove the uniqueness of the BR when 6(1) > max{6(2),c}. If Gi has r connected components, then 
lE'il = |A^| — r. By Femma|2l we must have |£^ 2 | > |.^| — t. By Femma[T] we know that 1^21 < \Ei \ = |A^| — r. 
Thus 1^21 = l^-il and since 6(1) > 6(2), we have G 2 = Gi. ■ 

B. Proof of Proposition 13 

Proof: The first statement is a direct result of Femma |7] 

In order to prove the second statement we use Femma [3 Let G* be the star network centered at node v. Since 
G^ is a 2-spanner tree of Gi, it is a BR to Gy. 
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Next, we prove the third statement. Define as the star network centered at node v. By equation (|2]), we have 

u{G^\Gi) = (|iV| - 1)(6(1) - c) + (|Si| - (|iV| - 1))6(2). (17) 

Now assume that G 2 = {N,E 2 ) is a BR network. Using the same argument as in equation (l5]l, we have 

u{G 2 \Gi) < \E 2 \ib{l) -c) + (lEil - \E2\)b{2). (18) 

Using equations (fTT]) and (fT^ we obtain 

u(G*|Gi) - u{G 2 \Gi) > {\E 2 \ - (lA^I - 1))(6(2) - 6(1) + c). (19) 

According to the assumption of the Proposition, c — 6(1) > 0 and thus the right hand side of equation ([T^ is 
nonnegative for all \E 2 \ > |A^| — 1. Therefore, the utility of G^ with respect to Gi is as high as any other connected 
network. 

Thus assume that G 2 is a non-empty disconnected network. Suppose that it has 7 components G 2 k = {Nk,E 2 k) 
for k G {1,2,-•• , 7 }. Denote hy Gik = {Nk,Eik), k G {1,2,-•• , 7 }, the subgraphs induced by Nk on Gi. 
Without loss of generality, let u G iVi. Then, since Gu contains a star subgraph (centered on v), and G 21 is a BR 
to Gii (by Lemma [5]) and connected, we can take it to be a star by the above argument. Next, we aim to show 
that there exists a BR (constructed based on G 2 ) such that all of the components are isolated nodes except G 21 . 

Suppose that some component of G 2 (not containing v) has more than one node and take this component to be 
G 22 without loss of generality. We know that G 22 is a BR to G 12 based on Lemma [5] Arguing as in equation ([5]), 
we have 

u{G 22 \Gi 2 ) < \E 22 mi) - c) + (|,Ui 2 | - |^ 22 |)i^( 2 ). ( 20 ) 


If G 22 has zero utility, we can replace it by the empty network and subsequently, we have the result. Thus assume 
by way of contradiction that it has some positive utility. Therefore, the right hand side of equation (l20l) is positive. 
Since G 22 is a connected network, 17^221 > |A" 2 | — 1- Hence 


|7i^l2| - 17^221 ^ |^12|-(|7V2|-1) ^ Ct') - (l^2| - 1) 

1 ^ 22 ! “ |iV 2 |-l “ |iV 2 |-l 

Using the assumption that the right hand side of inequality (l20b is positive and by 


|7V2| - 1. (21) 

inequality (l2T]) . we have that 


0 < \E 22 mi) -c) + (|,Ui 2 | - 1^221)6(2) 
= IE 22 I ((Ml) - C) + 

<1^221 (Ml)- c+(|iV2|-i)M2)). 


Now consider a graph G 2 obtained by removing all edges of G 22 and connecting all of its nodes to node v. Since 
6 ( 1 ) — c + (|iV 2 | — 1 ) 6 ( 2 ) > 0 we have, 

u{G2\Gi) > Y,u{G2i\Gu) + |7V2|(6(1) - c) + |77i2|6(2) 

> Y.u{G2i\Gu) + |7V2|(6(1) - c) + 1771216(2) - (6(1) - c + (|Ar 2 | _ i)6(2)) 

= Y.u{G2i\Gu) + (|7V2| - 1)(6(1) - c) + (|77 i2| - (|7V2| - 1))K2), (22) 

27^2 

where the first inequality follows from the fact that the induced subgraph of A^i U N 2 on G 2 is a connected network 
and we neglect the benefit (if any) from indirect connections between nodes in A^i \ {u} and N 2 . The second term 
in the first inequality captures the direct benefits and costs of the IA 2 I edges from nodes in N 2 to v, and the third 
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term captures the benefits due to each pair of nodes in N 2 having a distance of 2 from each other in G 2 (via v). 
Next, note that 

u{G22\Gi2) < |-E22|(^(1 ) — c) + (|-E'i2 | — \E22\)b{2) 

= (|iV 2 | - l)(6(l) - c) + i\E 22 \ - (|iV 2 | - 1))(6(1) - c) + {\Eu\ - \E22\)bi2) 

< (|iV 2 | - l)(6(l) - c) + (|£: 22 | - (|A^ 2 | - 1))6(2) + (IS 12 I - 1^221)6(2) 

= (|iV 2 | - 1)(6(1) - c) + i\Eu\ - (|A^ 2 | - 1))6(2). (23) 

Substituting inequality (1231) in inequality (l22l) . we have that 

u{G2\Gi) >Y,^{G2i\Gu) + (|iV2| - 1)(6(1) - C) + (|i?i2| - (|iV2| - l))b{2) 

i^2 

> Y,u{G2i\Gu) + u{G22\Gi2) = u{G2\Gi). 

i^2 

However this is a contradiction to the assumption that G 2 is a BR to Gi. Thus all of the nonempty components of 
G 2 (except G 21 ) must have zero utility and therefore, we can replace each of them by the empty network. ■ 

Appendix C 

Proof of Proposition [5] 

To prove Proposition |5l we will first need the following intermediate result. 

Lemma 9: Let 6(1) < c. Consider network G = {N,E) with components Gi = {Ni,Ei) for 1 < i < r 
{N = Ul^-^^Ni and E = Ul^^Ei). Assume that every induced subgraph of G has a 2-spanner forest. Then every 
BR of network G is composed of a BR to each component of G. 

Proof: Let E = (Ul^^ Aj, Ep) be a BR to G. Suppose by way of contradiction that E contains a non-empty 
component Fi = {W,R) with nodes from p different Aj where p > 2. Let Gp^ = {W,Epf} denote the induced 
subgraph of lA on G and T be a 2-spanner forest of Gp-^. The spanner forest T has q components where q > P- 
Also note that |i?| > |1A| — 1 > |1A| — q. Then we have 

u{Fi\Gp,) < |i?|(6(l) - c) + {\Epf - \R\)b{2) (24) 

= {\R\ - (|1A| - q)){b{l) - c) + (|1A| - q)m - c) + {\Epf - \R\)b{2) 

< (|ii| - (|1A| - q))b{2) + (|1A| - q)m - c) + {\Ep, \ - |i?|)6(2) 

= (|1A| - q){b{l) - c) + {\Ep, I - (|1A| - q))bi2) 

= uiT\Gp,), 

where the first inequality comes from the fact that at most \R\ pairs of nodes that are neighbors in Gp^ have direct 
connections in Fi and the remaining pairs of nodes are at a distance of at least 2 in Ai. The second inequality is 
due to 6(1) — c < 6(2). 

Inequality (l2^ means that by replacing Fi with T, we can increase the utility of network F which is a 
contradiction to the assumption that A is a BR to G. Therefore, no component of A contains nodes from multiple 
components in G and thus by Lemma the subgraph of A induced by Aj must be a BR to Gj for 1 < i < r, 
yielding the result. ■ 

We are now in place to prove Proposition |5] 

Proof: (Proposition [S]) 

Consider player Pj where 1 < j < m. If j > ii (i.e., Pj G Hq), then G =~ consists of disjoint 

complete graphs on node sets V)., A-i, • • • j Aq. Since 

kj P ki^+i > n — ii 

hj P ki^-\-\ P P ^2 


Jvj ^ ki^pY ^ ly —Z' 

kj PkiPir, 
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a best response of player Pj to any of these complete networks is the empty network (by Proposition [T] or Lemma |7]). 
Every induced subgraph of G has a star network on its non-empty components (which means it has a 2-spanner 
forest). Thus using Lemma the empty network Gj is a BR to the network of the other players. 

Next, we prove that for player Pj € Hi where 1 < ( < r — 1, the network Gj is a BR to the other players’ 
networks. From the definition of the sets Hi in ([T3]) . we have that ii+i < j < ii- Note that G =~ 
consists of disjoint complete graphs on node sets V;+i,..., l^.. It also has a component G = (uJ^Ql/j, Ec) of size 
n — ii^i- The structure of the network G can be described as a set of complete networks of size n — + — 

^2 -h 1,.. . , ii-i — ii + l,ii — ii+i where all of them have the common node Vj. These complete networks are on 
node sets Vq U {vj}, Vi U {vj },..., V;_i U {vj}, V/. Network G satisfies the condition of Lemma |9] and thus a BR 
to G can be obtained by finding a BR network to each component. Since 

kj P /cj,^2-1-1 ^ H+i *z+2 


kj P ^ P —1 P 

kj > ki > ir, 

the best response of player Pj to each of the complete networks on node sets V^+i,..., 14 in G is the empty 
network. 

Network G has a star subgraph centered at node Vj and hence by Proposition |3j there exists a BR network 
S = {u[^QVt, Es) that is a star network centered at node Vj with potentially some isolated nodes. Now assume 
that in the network S, there are edges from Vj to a nonempty strict subset of nodes Rg C Vg for some 0 < q < I, 
and the set of nodes in Vg \ Rg are isolated. Note that edges between node Vj and the set of nodes Rg are only 
useful for connections between nodes in Rg U {vj} and produces a utility of 

(^6,(1) - c,- + > 0, (25) 

where the inequality follows from the fact that this graph has utility at least as large as that of the empty network. 
Now construct a new network S' = (U^^q14, Es') by connecting a node u € Vg\Rg to Vj, i.e., Es' = EsLl{{vj,u)}. 
Then we have that u{S'\G) — u{S\G) = bj{l) — Cj + \Rg\bj{2) which must be a positive value by inequality (1251) . 
This contradicts the assumption that S' is a BR to G. Therefore, for each 0 <t < I, node Vj is either connected to 
all of the nodes in Vj or to none of them. Since 

kj < ki^ < n — ii + 1 
kj <ki^<ii—i2 + l 


kj < fcj, < ii_i -ii + l, 

a BR to all of the complete networks on nodes Vj U {r;j} in G is the star network for 0 < t < Z — 1. However, 
since kj > P ii — ii+i, the BR to the complete network on the set of nodes Vj is the empty network and 

thus all of the nodes in Vj \ {vj} must be isolated nodes. 

Therefore, we can conclude that a star network centered on the node Vj with peripheral nodes {uii+i ,... ,Vn} = 
UjZQVj, and all other nodes being isolated is a BR to the network of the other players; this is precisely the network 
Gj given in the statement of the proposition. 

Finally, we have to show that players Pj, l<j<P (i.e., Pj G Hr) are in Nash equilibrium. Similar to the above, 
for player Pj, G =~ consists of complete networks of size n — zi -|- 1,4 — Z 2 + 1, • • •, P-i —P + 1,4 

with the common node Vj. These complete networks are on node sets Vq U {vj}, Vi U {vj },..., Vj._i U {vj}, 14- 
By an argument similar to the above, since kj Pki>ir and 

kj <ki^ <n — ii + l 
kj <ki^<ii-i2 + l 

kj ^ ki^ ^ 4—1 4 T 1, 
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a star network centered on Vj with peripheral nodes = U[~q 14 (i.e., Gj) is a BR to the network of 

the other players. 

Therefore, for each player Pj € P, Gj is a BR to G =~ given networks are in Nash 

equilihrium. ■ 
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